Embedded state metric storage for MAP decoder of turbo codes

ABSTRACT

A method, an embedded state metric storage, is used for MAP (Maximum A Posterior)-based decoder of turbo codes to reduce the memory requirement of state metric storage. For MAP decoder, this method comprises selecting any state metric from the updated state metrics for each recursion direction, forward and reverse, and dividing the state metrics by the selected state metric; the selected state metric value becomes a constant, namely, one. The constant one state metric is embedded into the resulted state metrics. For log-MAP decoder, this method comprises selecting any state metric from the updated state metrics in each direction, forward and reverse, and subtracting the state metrics from the selected state metric; the selected state metric value becomes a constant, zero. The constant zero state metric is embedded into the resulted state metrics. One advantage of the embedded state metric storage during state metric updating and likelihood ratio calculation is to embed the information of the selected state metric into the resulted state metrics. Thus, the selected state metric is not required to be kept in the state metric memory, and calculation of a constant state metric in the resulted state metric can be omitted. Therefore, the latency and the area of implementation in ASIC will be reduced with this method of embedded state metric storage.

FIELD OF THE INVENTION

The present invention relates to implementation of MAP (Maximum A Posterior)-based algorithm. More specifically, the present invention reduces the memory requirement for state metric storage in MAP-based decoders by embedding the selected state metric information into the state metrics.

BACKGROUND OF THE INVENTION

Reliable data transmission is very important for wireless communication systems. Turbo codes, originally described by Berrou et al. “Near Shannon limit error-correcting coding and decoding Turbo codes,” Proc. 1993 Int. Conf. on Comm., pp. 1064-1070, which is incorporated herein by reference, can achieve a channel capacity near the Shannon Limit with sufficient Signal to Noise Ratio (SNR); also see U.S. Pat. No. 6,598,204 B1, “System and Method of Turbo Decoding” to Giese et al., issued on Jul. 22, 2003, and U.S. Pat. No. 6,516,444 B1, “Turbo-Code decoder” to Maru, issued on Feb. 4, 2003, each of which is incorporated herein by reference.

Decoding of Turbo codes is performed in an iterative way: the information processed by one decoder is fed into another decoder iteratively until a certain degree of convergence is achieved. Decoding within each decoder can be performed in several ways. Maximum A Posteriori (MAP) decoding based on the BCJR algorithm, proposed by Bahl et al., “Optimal decoding of linear codes for minimizing symbol error rate,” IEEE Trans. On Inf. Theory, pp. 284-287, March 1974, which is incorporated herein by reference, is widely used. BCJR decoding is done according to a trellis chart, by mapping the status of convolutional memory at time k to its states at time k. As a posteriori probability (APP) is used as the probability metric for the trellis, the BCJR algorithm is generally referred to as the Maximum A Posteriori method.

In practice, the a posteriori probability may be computed in terms of the log domain value, in order to reduce complexity. Such an algorithm is referred to as the log-MAP algorithm. In an effort to reduce the decoding complexity of the BCJR algorithm, the log-MAP algorithm utilizes a suboptimum realization of the BCJR algorithm, using log-likelihood ratios and some approximations to avoid calculating the actual probabilities, and simplify some computations, for example by omitting some insignificant computations in the BCJR algorithm to reduce the complexity. These approximation algorithms indeed have smaller complexity than the BCJR algorithm, though, their error performance is not as good as that of the BCJR algorithm. Furthermore, the construction of the entire trellis diagram is still necessary in these methods.

A further simplified log-MAP algorithm is referred to as the Max log-MAP algorithm. In log-MAP algorithm, a log summation of exponentiation operations, ln ${\sum\limits_{j}{\mathbb{e}}^{j}},$ is calculated during extrinsic information update. The exponentiation enhances the differences between individual values in the log summation. Hence, one term will dominate the log sum, which suggests the approximation ${\ln{\sum\limits_{j}{\mathbb{e}}^{a_{j}}}} \approx {\max\limits_{j}{\left( a_{j} \right).}}$ The log-MAP algorithm with this approximation is referred to as the Max log-Map algorithm. Thus, Max log-Map is much simpler than log-MAP, but at the expense of performance degradation.

The objective of the MAP algorithm is to obtain a good guess of bit information probabilities. These probabilities include systematic information, intrinsic information and extrinsic information. The extrinsic information is used by another decoder in the next iteration. After a certain number of iterations, the result of extrinsic information converges and the iterative process stops. The performance of Turbo decoder based on the MAP algorithm is close to the Shannon limit.

Although the performance of Turbo codes is near optimal, integrated circuit implementation of the MAP algorithm faces two main challenges: latency and memory requirement. Upon receiving a data frame, the MAP decoder works in an iterative way. In each iteration, the MAP decoder accesses the data frame and the extrinsic information generated in the previous iteration from head to tail (forward recursion) and then from tail to head (backward recursion) to collect decoding information. Based on the collected information, the decoder estimates the most likely input data. The estimation of extrinsic information is fed back to another decoder in the next iteration. This means that, for each iteration, the MAP decoder must process the data from the beginning of the data frame to the end, and then process in the reverse direction before the estimation of the extrinsic information will be made. For a data frame of n bits, the process of getting the extrinsic information takes 2n steps, and the estimation needs n steps. Hence, latency of the MAP algorithm is large. The MAP decoder has to keep all of the decoding information until the extrinsic information is generated, and the extrinsic information must be stored for the next iteration. For a data frame of n samples and a turbo code space of S, 2×n×S memory units are required to store the temporary information. For example, S=8, n=20730 in the turbo codes of the cdma2000 system, 331680 memory units are required for the MAP decoder.

It is desirable to provide a method to reduce the memory requirement and speed up the calculation in the MAP decoder.

SUMMARY OF THE INVENTION

The present invention relates to implementation of MAP-based algorithm in MAP-based turbo decoders. More specifically, in an illustrated embodiment, the present invention reduces the memory requirement for state metric storage in MAP-based decoders by embedding selected state metric information into the state metrics.

In a MAP or log-MAP decoder, it is important to calculate the relative probability values among state metrics, not the absolute probability values. A state metric with the largest value means the state is the correct state on the optimal decoding path with the highest probability. A state metric having a larger value provides more significance than one having smaller value during metric updating and extrinsic information calculation.

In the illustrated method, embedded state metric storage is used for a MAP decoder of Turbo codes to reduce the memory requirement for state metric storage. For a MAP turbo decoder, this method comprises selecting any state metric from the updated state metrics for each recursion direction, forward and reverse, and dividing the state metrics by the selected state metric; the selected state metric value becomes a constant, namely, one. The constant one state metric is embedded into the resulting state metrics.

The constant one state metric is defined as the embedded state metric.

For a log-MAP decoder, including its simplification a Max log-MAP decoder, this method comprises selecting any state metric from the updated state metrics in each direction, forward and reverse, and subtracting the state metrics from the selected state metric; the selected state metric value becomes a constant, zero. The constant zero state metric is embedded into the resulted state metrics. The constant zero state metric is defined as the embedded state metric. The advantage of embedded state metric storage during state metric updating and likelihood ratio calculation is to embed the information of the selected state metric into the resulting state metrics. Thus, the embedded (or selected) state metric is not required to be kept in the state metric memory, and calculation of a constant state metric in the resulting state metric can be omitted. Therefore, the latency and memory area utilization in this implementation will be reduced.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a simplified block diagram of the Turbo encoder and decoder on a digital communication system in which the present invention is used;

FIG. 2 is a simplified decoding path for the turbo code with a length=15 and S=8;

FIG. 3 is a simplified schematic block diagram of a representative, illustrative interconnection of resources detailing the operations of a MAP decoder using a known method;

FIG. 4 is a simplified schematic block diagram of a representative, illustrative interconnection of resources detailing the operations of MAP decoder in accordance with the present invention;

FIG. 5 is a diagram of the performance (i.e., Bit Error Rate (BER)) comparison of the MAP decoder with the present invention and with the known method on AWGN channel. The Turbo code used in the simulation is based on the cdma2000 standard with a Length=1146 and a code rate=1/3; and

FIG. 6 is a diagram of the performance comparison of the MAP decoder with the present invention and with the known method on AWGN channel. The Turbo code used in the simulation is based on the cdma2000 standard with a Length=3066 and a code rate=1/3.

DETAILED DESCRIPTION OF THE INVENTION

The following variables will be used in describing the present invention:

α_(k) is the state metric of the MAP decoder in the forward direction at the k-th step;

A_(k) is a natural log value of the state metric α_(k) of the MAP decoder in the forward direction at the k-th step;

β_(k) is the state metric of the MAP decoder in the reverse direction at the k-th step;

B_(k) is the natural log value of the state metric β_(k) of the MAP decoder in the reverse direction at the k-th step;

LLe_(k) is the extrinsic information of the MAP decoder for the k-th symbol;

LLR_(k) is the natural log value of the extrinsic information of the MAP decoder for the k-th symbol;

S is the collection of state space for the turbo codes;

m is the number of state registers in an encoder;

M (=2^(m)) is the number of discrete states in the collection state space S;

x_(k) is the k-th symbol on the decoding path;

y_(k) is the k-th parity code word on the decoding path;

ds_(k) is the k-th symbol fed into the decoder;

dp_(k) is the k-th parity code word fed into the decoder;

s⁺ is the sub-collection of the states transferred from s′ to s, (s′→s), when x_(k)=+1;

s⁻ is the sub-collection of the states transferred from s′ to s, (s′→s), when x_(k)=−1;

length is the length of the data frame for the decoder.

FIG. 1 is a simple block diagram representing a Turbo encoder and a Turbo decoder. The Turbo encoder 107 is formed by a combination of two simple encoders 109 and 110. Assuming a source data frame 101 with length information bits, the first encoder 109 receives as its input the information bits in its original from. The same length information bits are fed through an interleaver 111 that permutes the information bits before inputting them into the second encoder 110. The two encoders 109 and 110 generate parity symbols 103(y) and 104(y′) from two recursive convolutional codes, respectively. The parity symbols are used in the two recursive decoders at the receiver, respectively, to produce independent, extrinsic information to help decoding algorithm convergence. These encoded information bit streams, as well as source data 102(x) are then punctured by the puncturing mechanism 118 to save bandwidth. Puncturing is a selection scheme for parity symbols 103(y) and 104(y′) according to the coding rate.

The punctured information bit streams are transmitted through a noisy discrete memoryless channel 117, and received and de-punctured at a puncturing mechanism 119 at the decoder 108. In the decoder 108, a demodulator (not shown) sends soft information relating to the received bit stream, i.e., probabilities of the received values. These probabilities can be interpreted as containing the received information bit value and the associated confidence value, which indicates how likely this information bit is correct.

Firstly, the parity bit streams 105 a (ds) and 105 (dp) of the received information bits are fed to the first decoder 112. The first decoder 112 evaluates and combines the probabilities of the input bit streams to refine the soft information so that the confidence level of individual bits being correct is maximized, since the maximum a posteriori decoding based on the BCJR algorithm is used in the decoder. The refined probabilities are fed into the second decoder 115 along with the de-interleaved information bit stream and the second parities bit stream 106(dp′), again producing enhanced soft information. After a predetermined number of iterations, the decoding process is completed, and the soft values are available at the output. According to the study in “Near Shannon limit error-correcting coding and decoding Turbo codes,” Proc. 1993 Int. Conf. on Comm., pp. 1064-1070, which is incorporated herein by reference, the improvement of bit error rate (BER) worsens with the increase in the number of iteration. To simplify the implementation, the number of iterations is preferably selected as 7 or slightly bigger, although the invention is not limited to this range.

When data is exchanged between the two decoders 112 and 115, the soft values are reordered to match the interleaving structure. This is preformed by the interleaver 113 and de-interleaver 116. An interleaver is a device that rearranges the ordering of a sequence of symbols in some one to one deterministic manner. Associated with any interleaver is a de-interleaver, which is the device that restores the reordered sequence to its original ordering. The parity symbol information from two parallel recursive convolutional encoders is used in two decoders respectively. The extrinsic information from decoder I 112 must be interleaved to produce the same sequence order with the parity information generated by encoder II 110. The interleaved extrinsic information and source symbol information are fed into decoder II 115 with the parity information generated by encoder II 110 to perform the second decoding step. The extrinsic information from decoder II 115 is fed into de-interleaver 116 to restore its original ordering, which is the same ordering with source symbol information ds 104 and parity information dp 119 generated by encoder I 109. Thus, decoder I 112 can work correctly to calculate the extrinsic information for the next iteration.

FIG. 2 is a simplified decoding path for the Turbo code with a length=17 and S=8. In a MAP decoder, there are 8 states 202 corresponding to each step. Each state transits to two states of the next step corresponding to the input information bit being zero 203 and one 204, respectively. The MAP decoder evaluates and combines the probabilities of the received data 201 from head to tail to obtain forward state metrics α, and from tail to head to obtain reverse state metrics β. Then the MAP decoder uses the forward and reverse state metrics to refine the soft information, i.e., the MAP decoder finds out the optimal decoding path 205 from all possible paths. The optimal decoding path 205 is the best guess for all input data.

In the forward direction, the state metrics α_(k)=(α_(k)(0), α_(k)(1), . . . , α_(k)(M−1)) are obtained with α_(k−1)=(α_(k−1)(0), α_(k−1)(1), . . . , α_(k−1)(M−1)). In the reverse direction, the state metrics β_(k)=(β_(k)(0), β_(k)=(1), . . . , β_(k)(M−1)) are obtained with β_(k+1)=(β_(k−1)(0), β_(k−1)(1), . . . , β_(k−1)(M−1). According to the MAP algorithm, the state metrics represent the state transition probabilities from one step to the next step. If overflow occurs during state metrics updating due to the limited bit width of the state metrics in the implementation, errors are generated, and the errors will propagate inside the MAP decoder. Thus, the state metrics must be normalized by the term Σ_(s)Σ_(s′)α_(k−1)(s′)γ_(k)(s′,s). α_(k) and β_(k−1) are calculated as following $\begin{matrix} {{\alpha_{k}(s)} = \frac{\sum_{s^{\prime}}{{\alpha_{k - 1}\left( s^{\prime} \right)}{\gamma_{k}\left( {s^{\prime},s} \right)}}}{\sum_{s}{\sum_{s^{\prime}}{{\alpha_{k - 1}\left( s^{\prime} \right)}{\gamma_{k}\left( {s^{\prime},s} \right)}}}}} & (1) \\ {{\beta_{k - 1}\left( s^{\prime} \right)} = \frac{\sum_{s}{{\alpha_{k - 1}\left( s^{\prime} \right)}{\gamma_{k}\left( {s^{\prime},s} \right)}}}{\sum_{s}{\sum_{s^{\prime}}{{\alpha_{k - 1}\left( s^{\prime} \right)}{\gamma_{k}\left( {s^{\prime},s} \right)}}}}} & (2) \end{matrix}$ where sεS, s′εS

The state transition probability γ_(k)(s′,s), the branch metrics, is defined as $\begin{matrix} {{\gamma_{k}\left( {s^{\prime},s} \right)} = {p\left( {s_{k} = {{s/s_{k - 1}} = s^{\prime}}} \right)}} \\ {= {\exp\left\lbrack {{\frac{1}{2}{x_{k}\left( {{LLe}_{k} + {L_{c}{ds}_{k}^{s}}} \right)}} + {\frac{1}{2}L_{c}{dp}_{k}Y_{k}}} \right\rbrack}} \\ {= {{\exp\left\lbrack {\frac{1}{2}{x_{k}\left( {{LLe}_{k} + {L_{c}{ds}_{k}^{s}}} \right)}} \right\rbrack} \cdot {\gamma_{k}^{e}\left( {s^{\prime},s} \right)}}} \end{matrix}$ where ${\gamma_{k}^{e}\left( {s^{\prime},s} \right)} = {{\exp\left\lbrack {\frac{1}{2}L_{c}{dp}_{k}Y_{k}} \right\rbrack}.}$ The extrinsic information, soft information, is $\begin{matrix} {{LLe}_{k} = {\frac{\sum_{s^{+}}{{\alpha_{k - 1}\left( s^{\prime} \right)}{\gamma^{e}\left( {s^{\prime},s} \right)}{\beta_{k}(s)}}}{\sum_{s^{-}}{{\alpha_{k - 1}\left( s^{\prime} \right)}{\gamma^{e}\left( {s^{\prime},s} \right)}{\beta_{k}(s)}}}.}} & (3) \end{matrix}$

FIG. 3 is a simplified schematic block diagram of representative, illustrative interconnection of resources detailing the operations of the MAP decoder using a known method. The received data 302 and the extrinsic information of the previous iteration 308 are fed into the branch metric calculation unit 303 from the received data memory 301 and the extrinsic information of the previous iteration 314, respectively, to generate the branch metric γ_(k) ^(e) 304 for the likelihood ratio calculation, γ_(kf) 310 for the state metrics updating in the forward direction and γ_(kr) 309 for the state metrics updating in the reverse direction. The state metrics-updating unit in the forward direction 312 uses γ_(kf) 310 from the branch metric calculation unit 303 and the state metrics in the previous step α_(kf) 319 from the metric buffer/memory 316 to update the state metrics α_(kf+1) 318 in the forward direction. The state metrics-updating unit in the reverse direction 311 uses γ_(kr) 309 from the branch metric calculation unit 303 and the state metrics in the previous step β_(kr) 315 from the metric buffer/memory 316 to update the state metrics β_(kr−1) 317 in the reverse direction. When the branch metrics γ_(k) ^(e) 304 and the state metrics (α_(k−1),β_(k)) 313 in both recursion directions are available at the inputs of the likelihood ratio calculation unit 305, the likelihood ratio LLe(k) 306 is generated and stored in the extrinsic information buffer/memory for the next iteration.

In order to simplify the computation, the known log-MAP algorithm, such as has been shown in Robertson et al. “A comparison of optimal and suboptimal MAP decoding algorithms operation in the log domain,” Proc. 1995 Int. Conf. on Comm., pp. 1009-1013 and in Viterbi, “An intuitive justification and a simplified implementation of the MAP decoder for convolutional codes,” IEEE JSAC, pp. 260-264, February 1998, each of which is incorporated herein by reference, is employed in most implementations. The simplification is $\begin{matrix} {{{LLR}_{k} = \frac{\sum_{s^{+}}{{\alpha_{k - 1}\left( s^{\prime} \right)}{\gamma^{e}\left( {s^{\prime},s} \right)}{\beta_{k}(s)}}}{\sum_{s^{-}}{{\alpha_{k - 1}\left( s^{\prime} \right)}{\gamma^{e}\left( {s^{\prime},s} \right)}{\beta_{k}(s)}}}};} & (4) \end{matrix}$ where $\begin{matrix} {{\Gamma_{k}\left( {s^{\prime},s} \right)} = {\ln\left( {p\left( {s_{k} = {{s/s_{k - 1}} = s^{\prime}}} \right)} \right)}} \\ {= {{\frac{1}{2}{x_{k}\left( {{LLR}_{k} + {L_{c}{ds}_{k}^{s}}} \right)}} + {\frac{1}{2}L_{c}{dp}_{k}Y_{k}}}} \\ {= {{\frac{1}{2}{x_{k}\left( {{LLR}_{k} + {L_{c}{ds}_{k}^{s}}} \right)}} + {\Gamma_{k}^{e}\left( {s^{\prime},s} \right)}}} \end{matrix}$ where ${\Gamma_{k}^{e}\left( {s^{\prime},s} \right)} = {\frac{1}{2}L_{c}{dp}_{k}{Y_{k}.}}$ Jacobian equality can be used to simplify the exponential computation in log-MAP algorithm as follows: ln(e ^(a) +e ^(b))=Log_Sum(a,b)=max(a,b)+ln(1+exp(−|b−a|)  (5) For implementation, ln(1+exp(−|b−a|)) can be realized with a lookup table. According to previous studies, e.g., in Robertson et al., mentioned above, a lookup table size of 8 can provide enough accuracy.

The normalization in the log-MAP algorithm is modified as $\begin{matrix} {{A_{k}(s)} = {{A_{k}^{\prime}(s)} - {\max\limits_{s \in S}\left\{ {A_{k}^{\prime}(s)} \right\}}}} & (6) \\ {{{B_{k}(s)} = {{B_{k}^{\prime}(s)} - {\max\limits_{s \in S}\left\{ {B_{k}^{\prime}(s)} \right\}}}}{where}} & (7) \\ {{{A_{k}^{\prime}(s)} = {{LOG}\underset{s^{\prime} \in S}{\_}{SUM}\left\{ {F_{k - 1}(s)} \right\}}},{{F_{k - 1}(s)} = {{A_{k - 1}\left( s^{\prime} \right)} + {\Gamma_{k}\left( {s^{\prime},s} \right)}}}} & \left( {6a} \right) \\ {{{B_{k - 1}^{\prime}\left( s^{\prime} \right)} = {{LOG}\underset{s \in S}{\_}{SUM}\left\{ {R_{k}\left( s^{\prime} \right)} \right\}}},{{R_{k}\left( s^{\prime} \right)} = {{B_{k}(s)} + {\Gamma_{k}\left( {s^{\prime},s} \right)}}}} & \left( {7a} \right) \end{matrix}$

In log-MAP algorithm, extrinsic information is $\begin{matrix} {{{LLR}_{k} = {{{Log\_}\underset{s \in {S +}}{Sum}\left\{ {P_{k}(s)} \right\}} - {{Log\_}\underset{s \in {S -}}{Sum}\left\{ {P_{k}(s)} \right\}}}}{where}{{P_{k}(s)} = {{A_{k - 1}\left( s^{\prime} \right)} + {\Gamma_{k}^{e}\left( {s^{\prime},s} \right)} + {{B_{k}(s)}.}}}} & (8) \end{matrix}$ where P_(k)(s)=A_(k−1)(s′)+Γ_(k) ^(e)(s′,s)+B_(k)(s).

According to the idea of MAP and log MAP algorithms, state metrics at each step consists of a set of numbers limited within upper and lower boundaries. A state metric with the largest value means the state is the correct state on the optimal decoding path with the highest probability. If A_(k)(s) is the maximum, s is the correct state with the highest probability at the k-th step in the forward direction. If B_(k)(s) is the maximum, s is the correct state with the highest probability at the k-th step in the reverse direction.

In equations (3) and (8), the maximum of {α_(k−1)(s′)}/{A_(k−1)(s′)} and the maximum of {β_(k)(s)}/{B_(k)(s)} give the main contribution to LLe_(k) and LLR_(k). If ds_(k)=+1 transmits at the output of the encoder and the estimate of the decoder is correct, LLe_(k)/LLR_(k) is close to the upper boundary. If ds_(k)=−1 transmits at the output of the encoder and the estimate of the decoder is correct, LLe_(k)/LLR_(k) is close to the lower boundary. If the value of LLe_(k)/LLe_(k) is closer to the upper or lower boundary, the confidence of the estimate is higher at the k-th step. If the current maximum metric is much larger than the previous maximum metric, LLe_(k)/LLe_(k) will converge at the correct estimate sooner. Thus, it is important to know which state has the relative maximum, rather than the absolute maximum.

For a MAP algorithm in Turbo decoding, including the Max log-MAP algorithm, any one of the updated state metrics from equations (1) and (2) of each recursion direction can be selected as the embedded metric, for example, α′_(k)(0) and β′_(k−1)(0) are selected as the embedded metrics for the forward and reverse directions, respectively. The embedding procedure is as follows: α_(k)(s)=α′_(k)(s)/α′_(k)(0), sεS  (9); β_(k−1)(s)=β′_(k−1)(s)/β′_(k−)(0), sεS  (10).

For a log-MAP algorithm in Turbo decoding, any one of the updated state metrics from equations (6) and (7) of each recursion direction can be selected as the embedded metric, for example, A′_(k)(0) and B′_(k−1)(0) are selected as the embedded metrics for the forward and reverse directions, respectively. The embedding procedure is as follows: A _(k)(s)=A′ _(k)(s)−A′ _(k)(0), sεS  (11); B _(k−1)(s)=B′ _(k−1)(s)−B′ _(k−1)(0), sεS  (2). After the embedding procedure of MAP and log-MAP, the original relationship among the updated state metrics is maintained in the new set of state metrics due to the linear operations of division and subtraction. The embedded metric is always equal to a constant, i.e., one for a MAP decoder and zero for log a MAP decoder. Thus, {α(0)}={α₀(0), α₁(0), . . . , α_(length)(0)}/{A(0)}={A₀(0), A₁(0), . . . , A_(length)(0)} and {β(0)}={β₀(0), β₁(0), . . . , β_(length)(0)}/{B(0)}={B₀(0), B₁(0), . . . , B_(length)(0)} are not required to be stored in the state metrics memory. This technique is called embedded state metric storage (EMSM). It can reduce the memory requirement by a factor of 1/M.

Since α_(k)(0)/A_(k)(0) and β_(k)(0)/B_(k)(0) are always constant, one for a MAP decoder and zero for a log-MAP decoder after the embedding procedure, the calculation using the embedded metric as an input signal can be omitted during state metrics updating and extrinsic information calculation. Thus, EMSM will not increase the computational load. The contribution of the embedded metrics during the state metrics updating and extrinsic information calculation is provided by the embedded relationship among the other state metrics, and EMSM can achieve the same result with known methods.

FIG. 4 is a simplified schematic block diagram of representative, illustrative interconnection of resources detailing the operations of MAP decoder and log-MAP decoder using the present invention. The received data (ds,dp,dp′) 402 and the extrinsic information of the previous iteration LLe_in 408 are fed into the branch metric calculation unit 403 from the received data memory 401 and the memory of the extrinsic information of the previous iteration 414, respectively, to generate the branch metric γ_(k) ^(e) 404 which is applied to the likelihood ratio calculation unit 405, which produces at its output LLe_(k) 406 likelihood ratio calculation. The branch metric calculation unit 403 also produces the branch γ_(kf) 410 for the state metrics updating in the forward direction and the branch metric γ_(kr) 409 for the state metrics updating in the reverse direction.

The embedded state metrics-updating unit in the forward direction 412, in which multiplication for MAP algorithm or addition for log-MAP algorithm with the embedded metric are omitted, uses the branch metric γ_(kf) 410 from the branch metric calculation unit 403 and the state metric in the previous step α_(kf) 419 from the metric buffer/memory 416 to update the state metric α_(kf+1) 418 in the forward direction. The embedded state metrics-updating unit in the reverse direction 411, in which multiplication for MAP algorithm or addition for log-MAP algorithm with the embedded metric is omitted, uses the branch metric γ_(kr) 409 from the branch metric calculation unit 403 and the state metric in the previous step β_(kr) 415 from the metric buffer/memory 416 to update the state metric β_(kr−1) 417 in the reverse direction. The embedded metric is not stored in the state metric buffer/memory 416.

Updating of state metrics is preferably done in accordance with the known sliding window (SW) technique in practical systems to reduce the memory requirement. As mentioned in the background section, all of the metric information of the forward and reverse directions must be temporarily stored to calculate the extrinsic information. If the data frame length is large, the temporary memory will be huge. In practical systems, only part of the frame, called a window, instead of the whole frame is used to compute the extrinsic information, ignoring some of the extrinsic information, thus causing performance degradation. However, with a window sliding from the head to the end of the frame, all of the extrinsic information is obtained. The sliding window works as follows:

-   a) The window starts at the end of the data frame, let step number     k=0, source symbol frame length=L, window size=W and training     size=T; -   b) Perform forward metric update from the k^(th) source symbol     information, save the forward metric from k to k+W−1; if ((k+W+T)>L,     then perform forward metric update from k^(th) source symbol     information, save the forward metric from k to L−1; -   c) Perform reverse metric update from the (k+W+T)^(th) source symbol     information, save the forward metric from k+W to k+1; if ((k+W+T)>L,     then perform reverse metric update from the (L)^(th) source symbol     information, save the forward metric from L to k+1; -   d) Calculate extrinsic information from k to the (k+w−1) for source     symbols; if (k+W+T)>L, then calculate extrinsic information from k     to the (L−1) for source symbols; -   e) Update k=k+W; if k<L then go to step b; otherwise this iteration     terminates.     Thus the temporary memory depth reduces from 2 frame lengths to 2     window sizes. In generally, the training size should at least be 3     or 4 times of the state size. EMSM can help the SW technique to     reduce metric memory requirement in the same way as mentioned in the     previous paragraph.

When the branch metric γ_(k) ^(e) 404 and the state metrics (α_(k−1),β_(k)) 413 in both recursion directions are available at the input of the likelihood ratio calculation unit 405, the embedded likelihood ratio LLe_(k) 406, in which multiplication for the MAP algorithm or addition for the log-MAP algorithm with the embedded metric is omitted, is generated and stored in the extrinsic information buffer/memory for the next iteration.

FIG. 5 shows the performance comparison curves 501 comparing the log-MAP decoder in accordance with the present invention and the known method on an AWGN channel. The turbo code used in this performance analysis is based on the cdma2000 standard with Length=1146 and code rate=1/3. The log-MAP decoder with EMSM 502 can achieve the same performance with the known method 503, and EMSM can reduce the memory used by 12.5% compared to the known method. In FIG. 5 we find that there is almost no difference between the two curves, indicating that EMSM can achieve the same decoding performance as the traditional decoding scheme.

FIG. 6 shows the performance comparison curves 601 comparing the log-MAP decoder in accordance with the present invention and the known method on an AWGN channel. The turbo code used in this performance analysis is based on the cdma2000 standard with Length=3066 and code rate=1/3. The log-MAP decoder with EMSM 602 can achieve the same performance with the known method 603, and EMSM can reduce the memory used by 12.5% compared to the known method. Again, we find that there is almost no difference between the two curves, indicating that EMSM can achieve the same decoding performance as the traditional decoding scheme.

By virtue of the above techniques, a MAP decoder can be produced that has performance nearly identical to the conventional MAP decoder, yet which requires much less memory. The architecture shown in FIG. 4 can be used to construct the MAP decoder, which is implemented in an FPGA, PLC or ASIC with EMSM technique.

The present invention has been described above in connection with certain illustrative embodiments. However, the present invention is in no way limited to the disclosed embodiment, which are exemplary and not intended to limit the scope of the invention, which is to be interpreted in accordance with the appended claims. 

1. A method of state metrics storage in a turbo decoder applicable to various decoding algorithms, comprising: (a) updating the state metrics using a sliding window method; (b) selecting an embedded metric from the updated ones of the state metrics in forward or reverse recursion directions; (c) performing a different calculation on said updated state metrics and said embedded metric for respective different ones of said decoding algorithms; when the forward and reverse state metrics are available, performing extrinsic information calculation; (d) saving said embedded metric in a state metric buffer; (e) saving a result of said calculation at step (c) for use as an input signal during state metrics updating and extrinsic information calculation.
 2. The method according to claim 1, wherein a state metric is selected as said embedded metric.
 3. The method as in claim 1, wherein when the algorithm is the MAP algorithm, said calculation involves dividing said updated state metrics by one.
 4. The method as in claim 1, wherein when the algorithm is the log-MAP algorithm, said calculation involves subtracting said updated state metrics from zero.
 5. The method as in claim 1, wherein when the algorithm is the Max log-MAP algorithm, said calculation involves subtracting he said updated state metrics from zero.
 6. An apparatus for implementing state metrics storage in a turbo decoder applicable to various decoding algorithms, comprising: means for updating the state metrics using a sliding window method; means for selecting an embedded metric from updated state metrics in forward or reverse recursion directions; means for performing a different calculation on said updated state metrics and said embedded metric for respective different ones of said decoding algorithms; means for performing a different calculation on said updated extrinsic information with metrics metric for respective different ones of said decoding algorithms; means for saving said embedded metric in a state metric buffer; means for saving a result of said calculation for use as an input signal during state metrics updating and extrinsic information calculation.
 7. The apparatus according to claim 6, wherein the apparatus is realized with a programmable logic device (PLD).
 8. The apparatus according to claim 6, wherein the apparatus is realized with a field-programmable gate array (FPGA).
 9. The apparatus according to claim 6, wherein the apparatus is realized with application-specific integrated circuit (ASICs).
 10. The apparatus according to claim 6, wherein when the algorithm is the MAP algorithm, said calculation involves dividing said updated state metrics by one.
 11. The apparatus according to claim 6, wherein when the algorithm is the log-Map algorithm, said calculation involves subtracting said updated state metrics from zero.
 12. The apparatus according to claim 6, wherein when the algorithm is the Max log-MAP algorithm, said calculation involves subtracting said updated state metrics from zero. 